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Abstract 

Determining the precise integrality gap for the subtour LP relaxation of the traveling sales- 
man problem is a significant open question, with little progress made in thirty years in the 
general case of symmetric costs that obey triangle inequality. Boyd and Carr [3] observe that 
we do not even know the worst-case upper bound on the ratio of the optimal 2-matching to the 
subtour LP; they conjecture the ratio is at most 10/9. 

In this paper, we prove the Boyd-Carr conjecture. In the case that a fractional 2-matching 
has no cut edge, we can further prove that an optimal 2-matching is at most 10/9 times the 
Xf\ cost of the fractional 2-matching. 
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1 Introduction 

The traveling salesman problem (TSP) is the most famous problem in discrete optimization. Given 
a set of n cities and the costs c(i,j) of traveling from city i to city j for all i,j, the goal of the 
problem is to find the least expensive tour that visits each city exactly once and returns to its 
starting point. An instance of the TSP is called symmetric if c(i,j) = c(j,i) for all i,j; it is 
asymmetric otherwise. Costs obey the triangle inequality if c(i,j) < c(i,k) + c(k,j) for all i,j,k. 
The TSP is known to be NP-hard, even in the case that instances are symmetric and obey the 
triangle inequality. From now on we consider only these instances unless otherwise stated. 

Because of the NP-hardness of the traveling salesman problem, researchers have considered 
approximation algorithms for the problem. The best approximation algorithm currently known is 
a |-approximation algorithm given by Christofides in 1976 [7j- Better approximation algorithms 
are known for special cases. Exciting progress has been made recently in the case of the graphical 
TSP, in which costs c(i,j) are given by shortest path distances in an unweighted graph; Momke and 
Svensson |14j give a 1.461-approximation algorithm for this case. However, to date, Christofides' 
algorithm has the best known performance guarantee for the general case. 

There is a well-known, natural direction for making progress which has also defied improvement 
for nearly thirty years. The following linear programming relaxation of the traveling salesman 
problem was used by Dantzig, Fulkerson, and Johnson [8] in 1954. For simplicity of notation, we 
let G = (V, E) be a complete undirected graph on n vertices. In the LP relaxation, we have a 
variable x{e) for all e = (i,j) that denotes whether we travel directly between cities i and j on 
our tour. Let c(e) = c(i,j), and let 5(S) denote the set of all edges with exactly one endpoint in 
S C V . Then the relaxation is 

Min > c(e)x(e) 
(SUBT) subject to: ^ x(e) = 2, Mi € V, (1) 

ee<5(i) 

Y^ x(e)>2, VSCV, 3< \S\ < \V\ -3 (2) 

ee<5(S) 

< x(e) < 1, Ve e E. (3) 

The first set of constraints (fTl) are called the degree constraints. The second set of constraints pi) 
are sometimes called subtour elimination constraints or sometimes just subtour constraints, since 
they prevent solutions in which there is a subtour of just the vertices in S. As a result, the linear 
program is sometimes called the subtour LP. It is known that the equality sign in the first set 
of constraints may be replaced by > in case the costs obey the triangle inequality (Goemans and 
Bertsimas |12j : see also Williamson |19j). 

The LP is known to give excellent lower bounds on TSP instances in practice, coming within 
a percent or two of the length of the optimal tour (see, for instance, Johnson and McGeoch |13j). 
However, its theoretical worst-case is not well understood. In 1980, Wolsey [20] showed that 
Christofides' algorithm produces a solution whose value is at most | times the value of the subtour 
LP (also shown later by Shmoys and Williamson [IB]). This proves that the integrality gap of the 
subtour LP is at most |; the integrality gap is the worst-case ratio, taken over all instances of the 
problem, of the value of the optimal tour to the value of the subtour LP, or the ratio of the optimal 
integer solution to the optimal fractional solution. The integrality gap of the LP is known to be 
at least 3 via a specific class of instances. However, no instance is known that has integrality gap 
worse than this, and it has been conjectured for some time that the integrality gap is at most ^ 





Figure 1: Illustration of the worst example known for the ratio of 2-matchings to the subtour LP. 
The figure on the left shows the instance; all edges in the graph have cost 1, all other edges have 
cost 2. The figure in the center gives the subtour LP solution, in which the dotted edges have value 
2, and the solid edges have value 1; this is also an optimal fractional 2-matching. The figure on 
the right gives an optimal 2-matching, which is also the optimal tour. 

(see, for instance, Goemans |llj). The results of Momke and Svensson |14j show that in the case 
of the graphical TSP, the integrality gap is at most 1.461; if the graph is cubic, Boyd, Sitters, van 
der Ster, and Stougie [6] show that the gap is g, and Momke and Svensson extend this bound to 
subcubic graphs as well. 

There is some evidence that the conjecture might be true. Benoit and Boyd [2] have shown via 
computational methods that the conjecture holds for n < 10, and Boyd and Elliot-Magwood [5] 
have extended this to n < 12. In a 1995 paper, Goemans [11] showed that adding any class of 
valid inequalities known at the time to the subtour LP could increase the value of the LP by at 
most „; this is necessary for the conjecture to be true. Somewhat weaker evidence is as follows. 
A 2-matching is an integer solution to the subtour LP obeying only the degree constraints (IT]) 
and the bounds constraints (13]) rl A fractional 2-matching is a 2-matching without the integrality 
constraints. Boyd and Carr j3] have shown that the integrality gap for the 2-matching problem is at 
most |. Furthermore, Boyd and Carr [3] have shown that if the subtour LP solution is half- integral 
(that is, x(i,j) G {0, g, 1} for all i,j £ V) and has a particular structure then there is a tour of 
cost at most | times the value of the subtour LP. 

Not only do we not know the integrality gap of the subtour LP, Boyd and Carr have observed 
that we don't even know the worst-case ratio of the optimal 2-matching to the value of the subtour 
LP, which is surprising because 2-matchings are well understood and well characterized. They make 
the following conjecture. 

Conjecture 1 (Boyd and Carr [3j) The worst-case ratio of an optimal 2-matching to an opti- 
mal solution to the subtour LP is at most -§-. 

It is known that there are cases for which the cost of an optimal 2-matching is at least -^ times the 
optimal solution to the subtour LP; see Figure [JJ Boyd and Carr have shown that the conjecture 
is true if the solution to the subtour LP has a very special structure: namely, all variables x(e) G 
{0, |, 1}, the cycles formed by the edges e with x(e) = \ all have the same odd size k, and the 
support is (k — l)-edge-connectedjj In the general case, the only bound on this ratio that we 
know of is the Boyd and Carr bound on the integrality gap of 2-matchings; since the constraints of 
the subtour LP are a superset of the fractional 2-matching constraints, this implies the ratio is at 
most |. 

The work of Goemans [TTJ has some bearing on this conjecture. He studies the following linear 



1 We note that what we refer to here as 2-matchings, are also sometimes called 2-factors. 

2 In fact, they show in this case the optimal 2-matching has cost at most 3I ^ 1 times the subtour LP. 



program which is essentially same as the subtour LP in the case edge costs obey triangle inequality: 

Min y, c(e)x(e) 

{SUBT') subject to: ^ x ( e ) ^ 2 > VS C U, S / 0, (4) 

ee5(S) 

x(e) > 0, Ve e E. (5) 

Goemans shows (among other things) that adding comb inequalities to this LP can increase the LP 
value by at most -^; more precisely, he shows that if x is a feasible solution to (SUBT'), then -^-x 
is feasible for the LP obtained by adding comb inequalities to (SUBT'). It is known that adding a 
subset of the comb inequalities to the degree constraints (IT]) and bounds pi) gives the 2-matching 
polytope. This would imply the Boyd-Carr conjecture if it were known that there is an optimal 
solution that obeys the degree constraints when the comb inequalities are added to (SUBT'); as 
mentioned above, it can be shown that there is an optimal solution for (SUBT') that obeys the 
degree constraints when the edge costs obey the triangle inequality. But we do not know whether 
there is an optimal solution that obeys the degree constraints if the comb inequalities are added q 

The contribution of this paper is to improve our state of knowledge for the subtour LP by 
proving Conjecture [JJ 

We start by showing that in some cases the cost of an optimal 2-matching is at most -S the 
cost of a fractional 2-matching, which is a stronger statement than Conjecture [TJ in particular, we 
show this is true whenever the support of the fractional 2-matching has no cut edge. The example 
in Figure 1 shows that the ratio can be at least iP in such cases, so this result is tight. As the first 
step in this proof, we give a simplification of the Boyd and Carr result bounding the integrality 
gap for 2-matchings by g. In the case that the support of an optimal fractional 2-matching has 
no cut edge, the proof becomes quite simple. The perfect matching polytope plays a crucial role 
in the proof: we use the matching edges to show us which edges to remove from the solution in 
addition to showing us which edges to add. We note that this idea was independently developed 
in the recent work of Momke and Svensson, but also previously appeared in the reduction of the 
2-matching polytope to the matching polytope; see, for instance, Schrijver (TTJ Section 30.7]. We 
also use a notion from Boyd and Carr |4J of a graphical 2-matching: in a graphical 2-matching, each 
vertex has degree either 2 or 4, each edge has 0, 1, or 2 copies, and each component has size at least 
three. Given the triangle inequality, we can shortcut any graphical 2-matching to a 2-matching of 
no greater cost. 

To obtain our proof of the Boyd-Carr conjecture, we give a polyhedral formulation of the 
graphical 2-matching problem, and use it to prove Conjecture [JJ If x is a feasible solution for 
the subtour LP, then, roughly speaking, we show that -Sx is feasible for the graphical 2-matching 
polytope. Our previous results give us intuition for the precise mapping of variables that we 
need. Using the graphical 2-matching polytope allows us to overcome the issues with the degree 
constraints faced in trying to use Goemans' results. 

All the results above can be made algorithmic and have polynomial-time algorithms, though 
we do not explicitly determine running times. 

We conclude by posing a new conjecture, namely that the worst-case integrality gap is achieved 
for solutions to the subtour LP that are fractional 2-matchings (that is, for instances such that 

3 To quote Goemans [111 p. 348]: "One might wonder whether the worst-case improvements remain unchanged 
when one adds the degree constraints x(S{i}) — 2 for alii G V and restricts one's attention to cost functions satisfying 
the triangle inequality. We believe so but have been unable to prove it. The result would follow immediately if one 
could prove that the degree constraints never affect the value of the relaxation when the cost function satisfies the 
triangle inequality." 



adding the subtour constraints to the degree constraints and the bounds on the variables does not 
change the objective function value). 

In a companion paper, Qian, Schalekamp, Williamson, and van Zuylen |16| show that the proof 
of the Boyd-Carr conjecture can be used to help bound the integrality gap of the subtour LP for 
the 1,2-TSP. They show that the gap is at most -^ ~ 1.3086 < 3. They also give a proof that the 
cost of the optimal 2-matching is at most =§■ times the cost of a fractional 2-matching in the case 
that c(i,j) E {1, 2}, which gives an alternate proof of the Boyd-Carr conjecture in this case. 

Our paper is structured as follows. We introduce basic terms and notation in Section [2] In 
Section |3j we rederive the Boyd-Carr integrality gap for 2-matchings, and show that the gap is at 
most -§■ in the case the fractional 2-matching has no cut edge. In Section 4l we give the polytope 
for graphical 2-matchings and show how to use it to prove the Boyd-Carr conjecture. Finally, we 
close with our new conjecture in Section [5} 

2 Preliminaries 

We will work extensively with fractional 2-matchings; that is, optimal solutions x to the LP {SUBT) 
with only constraints M and (ph. For convenience we will abbreviate "fractional 2-matching" 
by F2M and "2-matching" by 2M. F2Ms have the following well-known structure (attributed to 
Balinski [1]). Each connected component of the support graph (that is, the edges e for which 
x(e) > 0) is either a cycle on at least three vertices with x(e) = 1 for all edges e in the cycle, or 
consists of odd-sized cycles with x(e) = \ for all edges e in the cycle connected by paths of edges e 
with x(e) = 1 for each edge e in the path (the center figure in Figure n] is an example). We call the 
former components integer components and the latter fractional components. Many of our results 
focus on transforming an F2M into a 2M, in which all components are integer. For that reason, we 
will often focus solely on how to transform the fractional components into integer components. We 
then call the edges of fractional components for which x(e) = \ cycle edges and the edges for which 
x(e) = 1 path edges. Note that removing a cycle edge can never disconnect a fractional component. 
If removing a path edge disconnects a fractional component, we call it a cut edge. The associated 
path of the path edge we will call a cut path, since every edge in it will be a cut edge. We will say 
that a fractional 2-matching is connected if it has a single component. 

We will use a concept introduced by Boyd and Carr [1] of a graphical 2-matching (G2M). As 
stated above, in a graphical 2-matching, each vertex has degree either 2 or 4, each edge has 0, 1, or 
2 copies, and each component has size at least three. Given the triangle inequality, we can shortcut 
any G2M to a 2M of no greater cost. Our techniques for transforming an F2M to a 2M actually 
find G2Ms. 

We will often need to find minimum-cost perfect matchings. By a result of Edmonds |9], the 
perfect matching polytope is defined by the following linear program (M): 

Min ^2 c(e)x(e) 
(M) subject to: ^ x(e) = 1, Mi £ V, (6) 

e£<5(i) 

J2 x(e) > 1, MS C V, \S\ odd, (7) 

ee<5(5) 

x(e) > 0, Ve 6 E. (8) 



3 2-matching Integrality Gaps 

In this section, we bound the cost of a G2M in terms of an F2M via combinatorial methods. We 
start by giving a proof of a result of Boyd and Carr [3] that there is a G2M of cost at most 3 the 
cost of an F2M. Our proof is somewhat simpler than theirs, but more importantly, it introduces 
the main ideas that we will need to obtain other results. We then show that if the F2M has no 
cut edges, we can improve the bound from » to j. The main idea of this section is that given an 
F2M, we define a matching problem and compute a perfect matching. The perfect matching tells 
us how to modify the fractional components by either duplicating or removing edges so that we 
obtain a G2M. We then relate the cost of the perfect matching found to the F2M by providing a 
feasible solution to the perfect matching LP (M) . We will need the following result of Naddef and 
Pulleyblank [15] ; we give the proof since we will use some of its ideas later on. 

Lemma 3.1 (Naddef and Pulleyblank |15j) Let G be a cubic, 2- edge- connected graph with 
edge costs c(e) for all e S E. Then there exists a perfect matching in G of cost at most I Xlee-B c ( e )- 

Proof: The main idea is to show that x(e) = g is a feasible solution to the matching polytope (M). 
The lemma then follows from the fact that (M) has integer extreme points. Since G is cubic, \V\ 
must be even, and Y^eeSd) x ( e ) = -*-• Now consider any S C V with \S\ odd. Because G is cubic, it 
must be that \5(S)\ is odd, and since G is 2-edge-connected, \5(S)\ > 2. Therefore \6(S)\ > 3, and 

£ee<5(S) X ( e ) ^ L " 

Theorem 3.2 There exists a G2M of cost at most I times the cost of an F2M if the F2M has no 
cut edge. 

Proof: As described above, it is sufficient to focus on a single fractional component of the F2M. 
Let G be the support graph of this component. 

To find the G2M, we find a minimum-cost perfect matching on the graph G' we obtain by 
replacing each path in G by a single edge, which we will call (at the risk of some confusion) a path 
edge. We set the cost of this edge to be the cost of the path in G, and we set the cost of a cycle edge 
in G' to the negative of the cost of the cycle edge in G. Note that G' is cubic and 2-edge-connected 
because the support graph G of the F2M has no cut edge. 

Given a minimum-cost perfect matching in G', we construct a G2M in G by first including all 
paths from G. If a path edge is in the matching in G' ', we double the path in G. If a cycle edge is 
not in the matching in G' , then we include the cycle edge in the G2M in G, otherwise we omit the 
cycle edge. 

We first show that this indeed defines a G2M: for each vertex, the degree is four if the perfect 
matching contains the path edge incident on the vertex (since in that case, the two cycle edges on 
the vertex cannot be in the perfect matching, and hence both are added to the G2M together with 
two copies of the path) , and it is two otherwise (since one cycle edge is in the perfect matching and 
hence only the other cycle edge and one copy of the path are added to the graphical 2-matching) . 
Note that any connected component indeed has at least three nodes, since for any doubled path, 
we also take the four cycle edges incident on the endpoints. 

We let C denote the sum of the costs of the cycle edges, and P the cost of the paths. Note 
that the cost of the F2M solution is \C + P. The cost of the G2M is equal to the cost of all 
edges in the support graph (P + C) plus the cost of the perfect matching. Because G' is cubic and 



2-edge-connected, we can invoke Lemma 3.1 to show that the perfect matching has cost at most a 



pattern 1 •O'C" ^O'O^ ^0*0^ * 

pattern 2 • <X> OO OO 
pattern 3 *0* *C*0* ^O^O* ^O* 

Figure 2: Illustrations of patterns for £ = 9. 

third the cost of the edges in G", or at most nP -oC. Hence the cost of the G2M is at most 

1 1 4 2 4 / l\ 

P + C+-P--C= -P + -C = -[P + -C) , 
3 3 3 3 3 V 2 J ' 

or at most | the cost of the F2M solution, as claimed. ■ 

The idea of using edges from a perfect matching to decide which edges to include in a matching 
and which edges to remove has also been used recently by Momke and Svensson [14J . 

We now modify the proof of the theorem above so that the result extends to the case in which 
the F2M has cut edges. 

Theorem 3.3 (Boyd and Carr [4]) There exists a G2M of cost at most | times the cost of an 
F2M. 

Proof: As described above, it is sufficient to focus on a single fractional component of the F2M, 
and we let G be the support graph of this component. 

We once again create a new graph G' from G, so that we can later define a matching problem 
in G' . The matching will again show us how to create a G2M in G. We extend the previous 
construction to deal with the case when the support graph has cut paths. We introduce a gadget 
in G' for each cut path in G, which replaces the cut path and its two endpoints. The other paths 
in G are again replaced by single edges in G' of cost equal to the cost of the path. Each cycle edge 
in G is also in G' with cost equal to the negative of its cost in G. 

To introduce the cut-path gadget, we begin by using an idea of Boyd and Carr jlj; namely, that 
we only need to consider three patterns to get an almost feasible graphical 2-matching on the cut 
path, when we allow ourselves to increase the cost by a third compared to the F2M. Suppose the 
cut path has £ edges and £ + 1 nodes, and let k = [£/3\ . We can remove every third edge, double 
the remaining edges to obtain groups of nodes that are 2-edge-connected, where we get k groups 
of three nodes that are G2M components, plus one group of £ — 3k £ {0, 1, 2} nodes. Alternatively, 
we could remove every third edge, starting from the first edge and double the remaining edges, in 
which case the first group has one node, the next k or k — 1 groups have three nodes and the last 
group again has one or two nodes. The final pattern removes every third edge, starting from the 
second edge, so that the first group has two nodes, the next k or k — 1 groups have three nodes, 
and, again, the last group has one or two nodes. Figure [2] illustrates the three patterns for £ = 9. 

To get a G2M that contains a certain pattern, we will ensure that if a group has size less than 
three, the G2M will include the two cycle edges incident on the first node (if the group is at the 
start of the pattern) or last node (if the group is at the end of the pattern) . 

We remark that there is exactly one pattern that starts with a group of size one, two and three, 
and hence two patterns need the G2M to include two cycle edges incident on the first node of the 
cut path. On the other hand, there is also exactly one pattern that ends with a group of size one, 
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pattern 3 



Figure 3: Pattern gadget for I = 9. 

two and three (the length of the cut path determines which of the three patterns ends with a group 
of size three: it is the second pattern if £ (mod 3) = 0, the third pattern if £ (mod 3) = 1 and the 
first pattern if £ (mod 3) = 2), and hence there are also two patterns that need the G2M to include 
the two cycle edges incident on the last node of the cut path. 

We are now ready to define the cut-path gadget. We replace each endpoint of the cut path 
in G by a path of length two in £?'; each of these new edges will have cost 0. Each node on the 
path will be connected to a pattern edge corresponding to one of the three patterns. The middle 
node is connected to the pattern edge corresponding to the pattern which does not need two cycle 
edges incident on the endpoint of the cut path (i.e. the pattern for which the group containing 
the endpoint has size three). We set the cost of a pattern edge to the cost of the edges in the 
corresponding pattern. See Figure [3] for an illustration of the gadget when £ = 9. 

If we replace each cut path in G by a cut-path gadget in G' , once again G' will be a cubic graph. 
It is not hard to check that G' is also 2-edge-connected because we have replaced the cut path in 
G with three pattern edges crossing the cut in G'. 

We argue that there is a minimum-cost perfect matching that uses exactly one edge from each 
cut-path gadget. Note that the fact that we replace only the cut paths in G by a cut gadget in 
G' means that a perfect matching in G' contains an odd number of pattern edges in a gadget. If 
it contains three pattern edges, then we could find a matching of no greater cost by choosing only 
one pattern edge, namely the pattern edge that is not incident on the middle node for the either 
one of its endpoints. Note that we can add two edges of cost that connect the four nodes incident 
on the other two pattern edges, to again have a perfect matching without increasing the cost. 

Now we show how to obtain a G2M in G from the minimum-cost perfect matching in G' . In the 
G2M we include all edges from G that are in paths which are not cut paths, the cycle edges in G 
which are not chosen by the perfect matching, duplicates of edges in paths in G that are chosen by 
the perfect matching, and the edges in a pattern if the corresponding pattern edge is in the perfect 
matching. 

We argue that this set of edges is a G2M in G. Note that if the perfect matching contains only 
the pattern edge incident on the middle node, then the two cycle edges that are adjacent to the 
gadget are also in the matching. Hence the corresponding endpoint in G of the cut path has no 
cycle edges incident on it in the G2M, but since the pattern edge is incident on the middle node, 
the corresponding pattern ensures that the node has degree two and is in a connected component 
of size three. If the perfect matching contains the pattern edge incident on a node other than the 
middle node, then neither of the two cycle edges that are adjacent to the gadget in G' are in the 
perfect matching. Hence the corresponding endpoint of the cut path in G has both of these cycle 
edges incident on it in the G2M, and zero or two edges from the pattern corresponding to the 
chosen pattern edge. Hence the node has degree two or four and it is in a connected component of 
size at least three. 



As before, because G' is cubic and 2-edge-connected, we can apply Lemma 3^ to bound the 
cost of the perfect matching in G' . Let Pi be the cost of the paths in G that are not cut paths, 
and P2 the cost of the cut paths in G, so that the cost of the F2M is Pi + P2 + 5C. Note that the 



cost of the three pattern edges in the gadget corresponding to a cut path sums up to four times the 
cost of the cut path. Thus the total cost of the edges in G' is Pi + 4P 2 — C. By Lemma 3.1 the 



cost of the perfect matching in G' is at most 3 Pi + 3P2 — g-C. The cost of the G2M corresponding 
to the minimum-cost perfect matching is therefore at most 

1 4 1424/1 

Pi + -Pi + -P 2 + C - -C = -P + -C = - P + -C 
33 3 3 3 3 V 2 

as claimed. ■ 

We now show how to use the ideas behind the cut-path gadget to obtain a better G2M if no 
cut paths exist. 

Theorem 3.4 If an F2M has no cut edge, then there exists a G2M of cost at most -§■ times the 
cost of the F2M. 

Proof: Once again we define a new graph G' from the support graph G of a fractional component 
of the optimal F2M. Each cycle edge in G is in G' with cost that is the negative of its cost in G. 
Each path in G and its two endpoints are replaced by the cut-path gadget used in the proof of 



Theorem 3.3. The costs of the pattern edges in G' are slightly different than in the previous proof: 
we subtract the cost of the original path from the cost of each pattern edge in its gadget. In other 
words, the cost of a pattern edge in G' is obtained by adding once the cost of the edges that appear 
twice in the pattern and subtracting the cost of the edges that do not appear in the pattern. Note 
that the sum of the costs of the three pattern edges in G' is equal to the cost of the original path 
in G. Also, note that the sum of the costs of any two pattern edges in G' is nonnegative: an edge 
on the path contributes its cost either positively to one pattern and negatively to the other, or 
positively to both patterns. 

We first argue that there is a minimum-cost perfect matching that chooses either zero or one 
pattern edge in each cut-path gadget. Suppose the perfect matching contains two pattern edges in 
a gadget. Note that on both sides of the gadget these pattern edges must be incident on the middle 
node, otherwise some middle node is not matched. Hence the four endpoints of the two pattern 
edges are connected in G' by two edges of cost zero. By the observation above, the cost of the two 
pattern edges is nonnegative, and so we can remove the two pattern edges from the matching and 
add the two edges of cost zero without increasing the cost of the matching. By the same argument, 
we can handle the case that the perfect matching contains three pattern edges from a gadget by 
choosing the pattern edge that is not incident on the middle node on both sides of the gadget, and 
replacing the other two pattern edges in the matching by the cost zero edges that connect their 
endpoints. 

Therefore, we can assume the perfect matching chooses either zero or one pattern edge in a 
gadget. If it chooses zero pattern edges, then we add the path from G to the G2M. Otherwise, 
the pattern corresponding to the chosen pattern edge is added to the G2M. We also add the cycle 
edges to the G2M corresponding to the cycle edges that are not in the perfect matching. 

By almost the same arguments as before, the solution constructed is indeed a G2M. The only 
case not covered by previous arguments is the case in which zero pattern edges are chosen in G' . 
Then it must be the case that one of the two cycle edges is chosen in G' and the other is not, so 
that one of the two cycle edges is included in the G2M and the other not. Since we include the 
path from G in the G2M if no pattern edges are chosen, the endpoint of the path will have degree 
two. 

To argue about the cost of the minimum-cost perfect matching in G' , we create a feasible 
solution for the matching linear program (M). To do this, for each pattern edge e, we set x(e) = g, 



and for every other edge e', we set x(e') = g. We will show this is a feasible solution in a moment. 
Let P be the cost of the path edges in the F2M, and C the cost of the cycle edges, so that the F2M 
has cost P + \C. Since the sum of the cost of the pattern edges in a gadget is equal to the cost of 
the path, the cost of this solution for (M) is |P — gC, and there exists a perfect matching of cost 
at most this much. Thus the cost of the G2M is at most 

P+ i P + c _^ = W p+ 5 10/ 1 

9 9 99 9 V 2 

as claimed. 

To see that x is a feasible solution for (M), consider any cut such that the number of nodes 
on each side of the cut is odd. If there exists a cycle from the F2M such that not all nodes in the 
gadgets for the nodes in the cycle are on the same side of the cut, then there are two edges crossing 
the cut with value |. Since G' is cubic, if the cut has odd size, then the total number of edges 
crossing the cut is odd, and there must be at least one more edge in the cut with value at least g. 
Hence the total value on the edges crossing the cut is at least one. For any other cut, since there is 
no cut path in G, there are at least three gadgets crossing the cut in G' . Since each gadget contains 
three pattern edges, the value of the edges crossing the cut is again at least one. ■ 



4 A Polyhedral Proof of the Boyd-Carr Conjecture 

We will generalize the result in Theorem |3.4| and show that the ratio between the cost of the optimal 
2-matching and the subtour LP is at most -§■. In the combinatorial proofs of the previous section, we 
heavily used the fact that F2Ms have a nice simple structure, and, unfortunately, this does not hold 
for the subtour LP solution. We therefore turn to a polyhedral rather than a combinatorial proof. 
We derive a polyhedral description for graphical 2-matchings, and we then use this description 
to construct a feasible (fractional) G2M solution from any solution to the subtour LP of cost not 
more than =§■ times the value of the subtour LP. The manner in which the feasible G2M solution 



is defined based on a solution to (SUBT) is a generalization of the proof of Theorem 3.4 

We start by giving a polyhedral description of a generalization of 2-matching, where the node 
set consists of "mandatory nodes" (Vman) and "optional nodes" (V op t)- The former need to have 
degree 2 in the solution, whereas the latter can have degree or 2. We will refer to this problem 
as the 2-Matching with Optional Nodes Problem (2MO). 

Theorem 4.1 Let G = (Vman U V op t,E) be a 2MO instance. The convex hull of integer 2MO 
solutions is given by the following polytope: 

Y, y( e ) = 2 , Vi e V man , (9) 

Y, y( e ) ^ 2 > Vi G y ° P t> (io) 

Y V(e) + ^(1 - y(e)) > 1, V5C7,FC 5(S), F matching, \F\ odd, (11) 

ee<5(5)\F eeF 

< y(e) < 1, Ve e E. (12) 



The proof of Theorem 4.1 is similar to the proof of the polyhedral description of the 2-matching 



polytope (Theorem 30.8) in Schrijver flTJ, and is deferred to Appendix [A} 



Recall the definition of a graphical 2-matching (G2M): (i) each vertex has degree either 2 or 4, 
(ii) each edge has 0, 1, or 2 copies, and (iii) each component has size at least three. We will (for 
the moment) relax the second condition so that each edge has at most 3 copies. 

Lemma 4.2 We can reduce a G2M instance G = (V,E) to a 2MO instance G' = (V',E f ) as 
follows: Let V^ an = {i m : i G V},V^ pt = {i : i G V},V = V^ n U V^ pt ,E' = {{i m ,j m ) : (i,j) G 
E} U {(i m ,j ) : (i,j) G E}. We add an edge {i,j} to the (relaxed) G2M solution for each edge 
(im,jm), (io,jm) and (i m , j ) that is in the associated 2MO solution. 

Proof: Note that condition (i) for node i directly follows from the degree constraints for nodes 
i m and i in the reduction. Relaxed condition (ii) follows from the fact that for every edge in the 
G2M instance there are three associated edges in the 2MO instance. Finally, since each node i m 
has degree 2 in the 2MO solution, there cannot be a component of size 1. Suppose there there is a 
component of size 2. Then this must be an isolated doubled or quadrupled edge, say (i, j), because 
of the degree constraints. Clearly we can't have a quadrupled edge since there are at most three 
copies of edge (i, j) in the 2MO solution. We also can't have an isolated doubled edge: in order for 
the edge to be isolated, we would need (im,jm) and (i m ,j ) to be in the 2MO solution. But then 
j must have degree 2, and its second edge must be (j , k m ) for some k ^ i,j, since there are no 
edges (i ,jo) or (jo,jm) in the 2MO instance. ■ 

If the edges have nonnegative costs, we may assume with loss of generality that each edge 
appears at most twice in an optimal G2M solution: if any edge appears three times, we can remove 
two copies of it without affecting the parity of its endpoints, and the cost cannot increase. 

We will now use a solution to the subtour LP on G = (V, E) to define a feasible solution to 
the 2MO instance G' = (V', E') associated with the graphical 2-matching problem on G. It will 
be instructive to first consider the case when the subtour LP solution x is an F2M with no cut 



edge. In that case, the proof of Theorem 3.4 gives us a way to construct a G2M solution. In fact, 



it allows us to find a probability distribution on G2Ms, such that the expected cost of the G2M 
is exactly -^ times the cost of the F2M solution. This probability distribution has a number of 
special properties: (i) if a G2M has positive probability, then each doubled edge is a path edge 
with x- value 1, and has exactly one endpoint that has degree 4; (ii) for each path edge (i,j) with 
x- value 1, the probability that it occurs twice and i has degree 4 is g, and the expected number of 
times (i,j) occurs is ^P. These observations give a hint as to how we should define a 2MO solution 
based on a subtour LP solution x. We think of the edge (i m ,j m ) as the first copy of the edge (i, j), 
and (i m ,j ) as the second copy if j has degree 4, and (i ,jm) as the second copy if i has degree 4. 
Then the probability of (i m ,jo) and (i ,jm) is \x{i,j) if x(i,j) = 1, and the probability of (i m ,jm) 
is ^x(i,j). This interpretation does not quite work for the cycle edges (i, j) with x-value ^, since 
at most one copy occurs in the G2M. 

A better interpretation is that we consider a Eulerian walk on each component of the G2M 
solution, and associate i m with the first time we enter and leave node i, and i with the second 
time we enter and leave node i (if i has degree 4). If we direct the walk in each of the two 
possible directions with probability ^, then the probability we use edge (i m ,jrn) is §x(i, j) and the 
probability we use edge (i m ,jo) is hx(i,j). We argue this as follows. 

For a path edge (i, j) with x(i,j) = 1, the probability that we use edge (i m ,jo) is g, since if j 
has degree 4, we know by the construction that (i, j) is a doubled edge, and i has degree 2. Hence, 
if j has degree 4, then (im,jo) is in the walk, and the probability that j has degree 4 is i. A similar 
argument shows that we use (i ,jm) with probability |. Also, the expected number of times we 
use edge (i, j) in the G2M is ^, so the probability of using (i m ,j m ) in the walk must be |. 
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For a cycle edge (i, j) with x(i,j) = 5, the probability that we use (i m ,j ) is h ' 3> since if j 
has degree 4, then the G2M contains a doubled path edge (j, k) where x(j, k) = 1 and /c has degree 
2. Hence the probability that we use (i m ,jo) is the probability that the walk is directed in such 
a way that we visit i before the loop from j to k and back, and this happens with probability -^. 
Similarly, the probability that we use (i ,jm) is yg, and the fact that the expected number of times 
we use an edge with x-value | in the G2M is |, shows that the probability of using (i m ,jm) in the 
walk must be |. 

The following lemma states that using the probabilities ^x(i, j) and gx(i,j) to define a fractional 
solution to the 2MO instance corresponding to the G2M instance G also yields a feasible solution 
if, rather than an F2M with no cut edge, x is a feasible solution to the subtour LP on G. 

Lemma 4.3 Given a graph G = (V, E), let x be a feasible solution to the subtour LP for G. Then 
the following solution is a feasible solution to the 2MO instance G' = (V',E') associated with the 
graphical 2-matching instance given by G for a = g : 

y(i m ,jm) = (1 -a)x(i,j) 
y(i m ,jo) = ax(i,j) 
y(io,jm) = ax(i,j) 

for all (i,j) £ E. 

Note that the cost of the constructed G2M solution is exactly -S times the cost of the solution 
of the subtour LP. Thus our result follows immediately from the lemma. 

Corollary 4.4 There exists a G2M of cost at most -S times the value of the subtour LP. 



Proof of Lemma 4.3 We need to show that y satisfies the constraints (|9|)-(12) on G' , where G' is 



defined as in Lemma 4.2 Constraints ([9]), (10) and (12) are obviously met, and we only need to 



show that constraints (11) are met. To this end, fix S C V' , F C 5(S) where F is a matching and 
\F\ is odd. We define z(e') = y(e') if e' £ 6(S)\F and z(e') = 1 — y(e') if e' £ F. For simplicity, for 
any set of edges X C E' , we define z{X) = J2e'ex z ( e ')- Then we need to show that z(S(S)) > 1. 

First, suppose S does not contain any node i m for any i £ V. For any j £ S, we have that 
z(5(S) PI 0~(j o )) = z({(i m ,j ) : i G V}). Since \F\ > 1, there exists some j £ S such that F 
contains some edge incident on j , say (i' m ,j ). Then, z({(i m ,j ) : i £ V}) = 1 — ax(i',j) + 
^2i & Vi^i' ax (hJ) = ax ($(j)) + 1 — 2ax(i',j). Now, note that x(5(j)) = 2 and x(i',j) < 1, hence 
z(8(S) n 5(j )) > 1. 

By symmetry, it remains to consider the case when both S and V'\S contain a node i m for 
some i £ V. 

We consider an edge e = (i, j) £ G such that at least one of the three edges (i , j m ), (j m , i m ), (i m ,j ) 
crosses the cut S in G' . Note that there are 2 3 — 1 = 7 possible choices for the edges that cross the 
cut. We discern five different types of edges in G for which at least one of the three corresponding 
edges crosses the cut (type II and type V each cover 2 of the possible choices) : 

(I) The edge (i m ,jm) crosses the cut. 
(II) The edges (i ,j m ) and (j m ,i m ) or the edges (j m ,i m ) and (i m ,j ) cross the cut. 

(III) The edges (i ,jm), (jm,i m ) and (i m ,j ) cross the cut. 

(IV) The edges (i ,j m ), (im,jo) cross the cut. 

(V) The edge (i ,jm) or the edge (i m ,j ) crosses the cut. 
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(d) Type IV. 



(e) Type V. 



Figure 4: Illustrations of the five types of cuts of the edges in the reduction. The y- value on the 
top and bottom edge is ax(i,j) and the y- value on the middle edge is (1 — a)x(i,j). 

Figure [4] illustrates the five types. 

We use the notation i* to denote either i m or i , and we will say an edge e' = (i*,j*) £ G' is in 
a gadget of type I, II, ... , V, if the edge (i,j) £ G is an edge of that type. 

We now consider three different cases, depending on the set F. 

Claim 4.5 If F contains an edge in a gadget of type IV or V, then z(5(S)) > 1. 

Proof: Let e' £ F be contained in a gadget of type IV or V. Note that e' has one endpoint in V^ an 
and one endpoint in V^ pt . Let e' = (i ,jm)- Since (j m ,i m ) does not cross the cut, i and i m are on 
different sides of the cut. 

Hence, the paths {(i ,j m ), (im; *m)l cross the cut S for every j' £ V. Each of these paths thus 
contribute at least ax(i,j') to z(5(S)) for j' ^ j. Also, since e' = (i ,jm) £ F, z ( e> ) = 1 ~~ ax(i,j). 
We thus get that z(8(S)) > Ylj'^j ax (hj') + 1 — (xx(i,j) = ^ ■/ ax(i,f) + 1 — 2ax(i,j) > 1, where 
the last inequality follows since ]f\./ x(i,j') = 2 by the degree constraints, and x(i,j) < 1. ■ 

For the remaining cases, we associate a cut R in the graph G with the cut S in G": let R = 
{i £ V : i m £ 5 1 } • Note that -R, V\i? are not empty. Note that if e is of type I, II, or III, then the 
edge (im,jm) crosses the cut, and hence, the edge e crosses the cut R in G. 

In the remainder of this proof, we will write z(5(S)) = y(5(S)) + \F\ — 2y(F), and we will give 
a lower bound on y(5(S)) to show that z(5(S)) > 1. In order to give a lower bound on y(5(S)), we 
need to use the fact that x satisfies degree constraints for each node, and that x(5(R)) > 2. It will 
therefore be convenient to relate the contribution to y(S(S)) of the three edges (i ,jm), (jm,im)i 
and (i m ,j ) to the edge (i,j) £ G, if (i,j) £ S(R), but also to the nodes i and j for certain types 
of nodes i,j £ V. 

In particular, we say a node i £ V is a lonely node if |{i m ,i } n 5| = 1. We let L be the set 
of lonely nodes. We assign each lonely node i an amount of ax(i,j), for each edge (i,j) of type I, 
II, . . . , V. Note that for each lonely node i, the paths {(i Q , j m ), (jm,im)} cross the cut for all j £ V, 
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and hence, each lonely node gets assigned a^- x(i,j), which by the degree constraints is equal to 
2a. 

(I) For an edge (i,j) of type I, the total contribution of the three edges (i ,j m ), (jm, im), (im,jo) 
to y(5(S)) is (1 — a)x(i,j). Note that both i and j are lonely nodes. We assign (1 — 3a)x(i,j) 
to the edge (i,j), and ax(i,j) each to nodes i and j. 
(II) For an edge (i,j) of type II, the total contribution of the three edges (i ,jm), (jm,im), (im,jo) 
to y(5(S)) is x(i,j). Note that only one of i,j is a lonely node, and we therefore assign 
(1 — a)x(i, j) to the edge (i,j), and ax(i,j) to the lonely node among i,j. 

(III) For an edge (i,j) of type III, the total contribution of the three edges (i ,jm), (jm, im), (im,jo) 
to y(5(S)) is (l+a)x(i, j), and neither i nor j is a lonely node. We therefore assign (l+a)x(i, j) 
to the edge (i, j). 

(IV) For an edge (i, j) of type IV, the total contribution of the three edges (i ,jm), (jm, im), (im,jo) 
to y(5(S)) is 2ax(i,j). Since (i,j) o~(R) and both i and j are lonely nodes, we assign to 
(i,j) and ax(i,j) each to i and j. 

(V) For an edge (i,j) of type V, the total contribution of the three edges (i ,jm), (jm,im), (im,jo) 
to y(5(S)) is ax(i,j). Since (i, j) 5(R) and only one of i and j is a lonely node, we can 
assign to (i,j) and ax(i,j) to the lonely node. 

By the argument above, we have assigned 2a to each lonely node. We now show how this fact, 
combined with the fact that x(5(R)) > 2 and the assignment of values to the edges in 5(R), allows 
us to conclude that z(8(S)) > 1. 

Claim 4.6 // |F| = I, then z(5(S)) > 1. 

Proof: Let F = {e'}. Let (i,j) be such that e' = (i*,j*). We will show that z(5(S)) = y(5(S)) + l- 
2y(e') > 1. Note that 2y(e') < 2(l-a)x(i,j) < 2 -2a, so it is enough to show that y(5(S)) > 2-2a. 

First, suppose that \L\ < 1. Then, there is no edge of type I, so to each edge e G o~(R), we 
assigned at least (1 — a)x(e). Hence, y(5(S)) > (1 — a)x(5(R)) > 2 — 2a, since x(5(R)) > 2 by the 
subtour elimination constraints. 

If \L\ > 2, then we assigned 2a to each node in L, giving at least 4a. We assigned at least 
(1 - 3a)x(e) to each edge e e 5(R). Therefore, y(5(S)) > 4a + (1 - 3a)x(S(R)) > 2 - 2a, where 
we again use that x(5(R)) > 2. ■ 

Claim 4.7 // \F\ > 3, then z(S(S)) > 1. 



Proof: By Claim 4.5, we may assume that all edges in F are contained in a gadget of type I, II or 
III, and hence, that the corresponding edges in e S G are in 5(R). Let E\,E2,E% be the edges in 
S(R) of type I, II and III, respectively, for which the gadget contains one or more edges in F. 

Note that a lonely node i can be incident on at most one edge in E\ U Ei U E3 : Only the edges 
(i,j) G E1UE2 can be incident on a lonely node i, and in the first case, (i m ,jm) must be in F, and 
in the second case, either (i m ,j ) or (i m ,jm) is in F, since these are the only edges that cross the 
cut for these types. Now, since F is a matching, it can have at most one edge incident on i m and 
hence i can be incident on at most one edge in EiL) E2U E3. 

We therefore have that 

y(S(S)) > (1 - 3a)x(E 1 ) + Aa\Ei\ + (1 - a)x(E 2 ) + 2a\E 2 \ + (1 + a)x(E 3 ). 
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On the other hand, since F is a matching, only the gadgets for edges of type III can contain 
two edges in F. Hence, \F\ = \E±\ + I-E2I + (1 + /3)|i?3|, where (3 is the fraction of edges in £3 for 
which two edges in the corresponding gadget are contained in F. 

Also, y(F) < (1 — a) (x(Ei) + x(E 2 ) + x(E 3 )), since y{(i*, j*)) < (l—a)x(i,j), and, if two edges 
in the gadget for e G E 3 are contained in F, then these edges both have y- value ax(e), and since 
a < g, 2ax(e) < (1 — a)x(e). 

Hence, we get that 

*(<J(S)) = y(5(S)) + \F\-2y(F) 

> (1 + 4a) |Si I + (-1 - a)x{E x ) + (1 + 2a)|£ 2 | + (-1 + a)x{E 2 ) 
+\E 3 \ + (-1 + 3a)x(E 3 ) + f3\E 3 \ 

> 3a(|£i| + |£ 2 | + l^sl) + /3|^ 3 | > 3a|F|, 

where the penultimate inequality follows from the fact that x(Ek) < \Ef.\ and a < n, and the last 



inequality from the fact that a < o. Hence, if we choose a = i, then z(<5(5')) > 1. 



5 Conjectures and Conclusions 

/ conjecture that there is no [polynomial-time] algorithm for the traveling salesman prob- 
lem. My reasons are the same as for any mathematical conjecture: (1) It is a legitimate 
mathematical possibility, and (2) I do not know. 

- Edmonds 110)1 

We conclude our paper with a conjecture. We do so in the spirit of Jack Edmonds, quoted 
above; we do not know whether the conjecture is true or not, but we think that even a proof 
that this conjecture is false would be interesting. Our conjecture says that the integrality gap (or 
worst-case ratio) of the subtour LP is obtained for specific kinds of vertices of the subtour polytope; 
namely, ones in which the subtour LP solution has no subtour constraint as part of the dual basis, 
or, restated a different way, for costs c such that an optimal subtour LP solution for c is the same 
as an optimal fractional 2-matching for c. Let us call such costs c fractional 2-matching costs for 
the subtour LP. Note that for such solutions of the subtour LP, the fractional 2-matching will have 
no cut edge. 

Conjecture 2 The integrality gap for the subtour LP is attained for a fractional 2-matching cost 
for the subtour LP. 

We could make a similar conjecture for the ratio of the cost of the optimal 2-matching to the 



subtour LP, but by Theorem 3.4 and Corollary 4.4 we already know that the conjecture is true. 



However, its truth does not shed any light on the conjecture above. 

In a companion paper, Qian et al. [16J show that if an analogous conjecture for edge costs 
c(i,j) G {1, 2} is true, then the integrality gap for 1,2-TSP is at most g. They conjecture that the 
integrality gap for the 1,2-TSP is at most -^; it is known that it can be no smaller than -^. It 
would be nice to show that if the analogous conjecture is true then the integrality gap for 1,2-TSP 
is at most ^. 

Interestingly, we appear to know almost nothing about the consequences of Conjecture [2] Even 
for this very restricted set of cost functions, we do not know a better upper bound on the integrality 
gap of the subtour LP other than the bound of | . Note that the lower bound of 3 is attained for a 
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fractional 2-matching cost. It would be very interesting to prove that for such costs the integrality 
gap is indeed |. Boyd and Carr [3] have shown this for some fractional 2-matching costs in which 
all the cycles of the fractional 2-matching have size 3; this result also follows from the technique of 



Theorem 3.2, since the resulting graphical 2-matching is Eulerian if all cycles have size 3 and the 
fractional 2-matching has a single component (the graphical 2-matching may not be connected if 
there are cycles of size 5). 
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A Polyhedral description of 2MO 

We repeat Theorem |4.1| for sake of completeness. 



Theorem A.l Let G = (Vman U V Q pt,E) be a 2MO instance. The convex hull of integer 2MO 
solutions is given by the following polytope: 

^2 X ( e ) = 2 > yi G ^an ( 13 ) 

e£(5(j) 

Y, < e ) < 2 > v * e ^>pt (14) 

Y x(e) + 5^(1 - x(e)) > 1, VS C V, F C 5{S), F matching, \F\ odd, (15) 

ee<5(S)\F eeF 

0<x(e)<l, VeeE. (16) 
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Proof: The proof that we present here is similar to the proof of the polyhedral description of the 
2-matching polytope (Theorem 30.8) in Schrijver |17j . We will first show that any 2MO solution 
is contained in the polytope, and next show that the extreme points of the polytope coincide with 
the 2MO solutions. 



Constraints (13), (14) and (16) obviously hold for a 2MO solution. To show that constraint (15) 



is satisfied, we consider two cases: (case 1) There is a e E F with x(e) = 0. This makes the left 



hand side of constraint (15) at least 1, since x(e) > for all e. (case 2) x(e) = 1 for all e E F. Since 
\F\ is odd, and each node is incident to an even number of edges in an 2MO solution, it follows 
that there has to be an edge in the solution in 5(S) that is not in F. So the constraint also holds 
in this case. 

The polytope thus contains all 2MO solutions. We will now show that its extreme points 
coincide with 2MO solutions, by reducing 2MO instances to matching instances, for which perfect 
matchings correspond to 2MO solutions. We will show that any feasible point in the 2MO polytope 
corresponds to a feasible point in the perfect matching polytope. Because any point in the perfect 
matching polytope can be written as a convex combination of perfect matchings this implies that 
any point in the 2MO polytope can be written as a convex combination of 2MO solutions, and 
therefore all extreme points of the 2MO polytope correspond to 2MO solutions. 

Before we consider the reduction to perfect matchings, we will first show that adding con- 



straint (15) for all F C E of odd cardinality does not change the 2MO polytope. These additional 
constraints will be convenient when showing that a feasible point in the 2MO polytope is in the 
perfect matching polytope. 

We prove this by induction on \F\. Consider S and F C S(S) so that F is not a matching, i.e. 
\F n 5(i)\ > 2 for some i E V. We consider three cases. 

• (Case 1) |jPn«5(*)| > 3. Then 

E x(e) + ^(l-x(e))>^(l-x( e ))> £ (l-*(e))>3- E x ^ 

ee<5(5)\F eSF eSF eSFn<5(i) eeFn<5(«) 

> 3- E x ( e ) > 3-2 > 1. 

eed(i) 

• (Case 2) \F n 8(i)\ = 2 and i E S. Let F' = F \ 6(i) and let S' = S \ {i}. Then 

E x(e) + E(l - x(e)) 

ee8(S)\F e€F 

> E x(e)-E^)+ E x(e)+Y(l-x(e))+ E U " ^)) 

e£S(S')\F' eeS(i) eS5(i)nF eeF' ee<5(i)nF 

= E a^+EC 1 -^))- E x ( e ) + 2 - 

ee5{S')\F' e&F' ee5(i) 

By induction and the degree bound for i, this quantity is at least 1. 



• 



(Case 3) \F D 5(i)\ = 2 and i g" S. Let F' = F \ 5(i) as in the previous case, but now let 
S' = S U {i}. Then the exact same string of inequalities as in the previous case holds. 



We now use the usual reduction from 2-matchings to matchings (see Theorem 30.7 in Schrijver, 
the notation of which we will also follow): for each node i in the 2MO, there will be two nodes 
in the matching instance: i' and i" . For each edge e = (i,j) in the 2MO instance, there will be 
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Figure 5: Illustration of the reduction from 2MO to matchings. The part of the matching instance 
is drawn which corresponds to an edge between a mandatory node i, and an optional node j. 



two nodes and five edges in the matching instance: nodes p e ,i and p e j, and edges (i',Pe,i), (i",Pe,i), 
(Pe,i,Pe,j), (j'iPej), and (j",p e ,j)- The only difference between the reduction from 2-matchings to 
matchings, and the reduction from 2MO to matchings is that for optional nodes we also add an 
edge between nodes %' and i". An illustration of the reduction is given in Figure [5| where the part 
of the matching instance is given which corresponds to an edge between a mandatory node i, and 
an optional node j. 

Given a (fractional) solution a; to a 2MO instance, we define a solution y to the corresponding 
matching instance as follows: 



y(i',p e ,i) = y(i",p e ,i] 

y(Pe,i,Pe,j) = 1 ~x{e) 



1 



-x(e) and 



for all e = (i,j) G E, and 



y(i',i") = 1- - ^2 x(e) for all i G V opt . 



eeS(i) 



We will now show that this solution is indeed in the perfect matching polytope given by the 
constraints Mm, fcfh and ([8]) of the linear program (M) in Section^] (where the variables are here 
called y instead of x). For nodes p e ^, the degree bound constraints ffity follow directly from the 
definition of y (there are three edges incident on p e ^ with y- values ^xie), \x{e) and 1 — x(e), 
which sum to 1). For the other nodes, constraint (pi) follows directly from the degree bound 



constraints (13) o r (|14[ ) in the 2MO instance and the definition of y. Constraints (|8j) follow directly 
from constraints (|16|). 

We will now prove that constraints ([7]) also hold for all subsets of nodes of odd cardinality in 
our reduction. Let S' be such a subset. We consider four cases. 

• (Case 1) \{i',i"} n S'\ = 1 for some i S V. Note that we have edges (i',p e ,i) and (i" ,p e ,i) in 
the reduction both of which have y-value ^x(e), and of which exactly one will be in 5(S'). 
Furthermore, (i',i") is in 5(S') if i is in V opt . Therefore J2e'eS(S') ^( e ') - ^2ee5(i) l x ( e ) = 1 
if i G V man by the degree bound (13). Similarly ^ e 'e<5(5') ^( e ' 

2 ^2e£6(i) X ( e )) 



> 



E 



1 if i G V^pt by the degree bound (14) 



l 

eS<5(i) 2 



x(e) + (1 



(Case 2) For some e = (i,j) G E, p e ,i G S',p e ,j S' and {i',i"} D S' = 0. Let p 
T, e >eS(s>) y( e ') ^ V(P> *') + y& i ") + y(P^Pe,j) = \x{e) + \x{e) + 1 - x(e) = 1. 



p e i. Then 
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(Case 3) For some e = (i,j) G E, p e< i G S',p e j 5' and {j',j"} C 5'. Let p = p e j. Then 
X)e'e<J(S') y( e ') - y&> J') + J/(P' ■?'") + ?/(P' Pe,i) = \ x ( e ) + ^(e) + 1 - x(e) = 1. 

(Case 4) We may now assume that 5" is such that \{i' , i"} n 5'| is even for all i G V, and that 
{i', i"} G S' and {j',j"} n 5' = if p e> j G 5' and p e j G" S", because otherwise we are in one of 
the previous cases. Define 5 = {i G V : i! G 5' and i" G 5'} and F = {e = {i,j} G -E : p e) j G 
S" and p e j 5"}. Note that the previous argument implies that F C <5(S). 

Consider e = (i, J) G S(S) in the 2MO instance, and assume without loss of generality that 
i G S. By definition of 5, this means {j',j"} n S" = 0. We consider e G F and e ^ F 
separately. First of all, assume e G F. Since we are not in the previous cases this means that 
p e ,i G 5' and p e j S' . So for each such e in the 2MO instance, we have (p e ,i,Pe,j) G S(S') in 
the matching instance, with an y- value of 1 — x(e). Second, assume e G" F. By definition of 
F, we know that either p e ^ and p e j are both in 5", or both not in 5'. So for each such e in 
the 2MO instance, we have either {(i',p e ,i), (i",Pe,i)} f~ S(S') or {(j',p e ,j), (j",Pe,j)} <= 5(5") 
in the matching instance, each of which carry a total y- value of x(e). 

We thus get E e ' e «5(5') vi e ') ^ E ee <5(5)\F x ( e ) + EeeM 1 ~ x ( e ))- We tnen note that l-^l is 
equal to the number of nodes of the type p e< i in S' , which implies that the parity of \F\ and 
\S'\ are always the same, as the other nodes in S' appear in pairs. Thus since \S'\ is odd, \F\ 
is odd, and we have E e 'e-5(5') 2/( e ') ^ J2ee8(s)\F x ( e ) + Ylecpi 1 ~ x ( e )) > l h Y the feasibility 



of x for constraints (15). 



We conclude the proof by noting that a perfect matching in the constructed instance corresponds 
to the 2M0 solution consisting of all edges e = (i,j) for which (p e ,i,Pe,j) is not in the perfect 
matching solution. ■ 
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